Overview
Brought to you by YData
Dataset statistics
| Number of variables | 9 |
|---|---|
| Number of observations | 768 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 54.1 KiB |
| Average record size in memory | 72.2 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 1 |
Age is highly overall correlated with Pregnancies | High correlation |
Insulin is highly overall correlated with SkinThickness | High correlation |
Pregnancies is highly overall correlated with Age | High correlation |
SkinThickness is highly overall correlated with Insulin | High correlation |
Pregnancies has 111 (14.5%) zeros | Zeros |
BloodPressure has 38 (4.9%) zeros | Zeros |
SkinThickness has 227 (29.6%) zeros | Zeros |
Insulin has 374 (48.7%) zeros | Zeros |
BMI has 11 (1.4%) zeros | Zeros |
Age has 63 (8.2%) zeros | Zeros |
Reproduction
| Analysis started | 2024-10-22 17:45:24.384547 |
|---|---|
| Analysis finished | 2024-10-22 17:45:54.039939 |
| Duration | 29.66 seconds |
| Software version | ydata-profiling vv4.11.0 |
| Download configuration | config.json |
Variables
Pregnancies
Real number (ℝ)
High correlation  Zeros 
| Distinct | 15 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.28423997 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 111 |
| Zeros (%) | 14.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.074074074 |
| median | 0.22222222 |
| Q3 | 0.44444444 |
| 95-th percentile | 0.74074074 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.37037037 |
Descriptive statistics
| Standard deviation | 0.2477153 |
|---|---|
| Coefficient of variation (CV) | 0.8715006 |
| Kurtosis | -0.070852832 |
| Mean | 0.28423997 |
| Median Absolute Deviation (MAD) | 0.14814815 |
| Skewness | 0.85396175 |
| Sum | 218.2963 |
| Variance | 0.061362871 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.07407407407 | 135 | |
| 0 | 111 | |
| 0.1481481481 | 103 | |
| 0.2222222222 | 75 | |
| 0.2962962963 | 68 | |
| 0.3703703704 | 57 | |
| 0.4444444444 | 50 | 6.5% |
| 0.5185185185 | 45 | 5.9% |
| 0.5925925926 | 38 | 4.9% |
| 0.6666666667 | 28 | 3.6% |
| Other values (5) | 58 |
| Value | Count | Frequency (%) |
| 0 | 111 | |
| 0.07407407407 | 135 | |
| 0.1481481481 | 103 | |
| 0.2222222222 | 75 | |
| 0.2962962963 | 68 | |
| 0.3703703704 | 57 | |
| 0.4444444444 | 50 | 6.5% |
| 0.5185185185 | 45 | 5.9% |
| 0.5925925926 | 38 | 4.9% |
| 0.6666666667 | 28 | 3.6% |
| Value | Count | Frequency (%) |
| 1 | 4 | 0.5% |
| 0.962962963 | 10 | 1.3% |
| 0.8888888889 | 9 | 1.2% |
| 0.8148148148 | 11 | 1.4% |
| 0.7407407407 | 24 | |
| 0.6666666667 | 28 | |
| 0.5925925926 | 38 | |
| 0.5185185185 | 45 | |
| 0.4444444444 | 50 | |
| 0.3703703704 | 57 |
Glucose
Real number (ℝ)
| Distinct | 136 |
|---|---|
| Distinct (%) | 17.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5189883 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 5 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.25868726 |
| Q1 | 0.38223938 |
| median | 0.49343629 |
| Q3 | 0.63706564 |
| 95-th percentile | 0.88880309 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.25482625 |
Descriptive statistics
| Standard deviation | 0.1926639 |
|---|---|
| Coefficient of variation (CV) | 0.37122975 |
| Kurtosis | -0.13216039 |
| Mean | 0.5189883 |
| Median Absolute Deviation (MAD) | 0.12355212 |
| Skewness | 0.41794622 |
| Sum | 398.58301 |
| Variance | 0.037119377 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.3822393822 | 17 | 2.2% |
| 0.3884169884 | 17 | 2.2% |
| 0.4563706564 | 14 | 1.8% |
| 0.5675675676 | 14 | 1.8% |
| 0.5428571429 | 14 | 1.8% |
| 0.4254826255 | 14 | 1.8% |
| 0.4625482625 | 13 | 1.7% |
| 0.4378378378 | 13 | 1.7% |
| 0.3575289575 | 13 | 1.7% |
| 0.4193050193 | 13 | 1.7% |
| Other values (126) | 626 |
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 0.04247104247 | 1 | 0.1% |
| 0.1166023166 | 1 | 0.1% |
| 0.1227799228 | 2 | 0.3% |
| 0.1474903475 | 1 | 0.1% |
| 0.1536679537 | 1 | 0.1% |
| 0.1722007722 | 1 | 0.1% |
| 0.1845559846 | 1 | 0.1% |
| 0.1907335907 | 3 | |
| 0.2092664093 | 4 |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.1% |
| 0.9938223938 | 1 | 0.1% |
| 0.9876447876 | 4 | |
| 0.9814671815 | 3 | |
| 0.9752895753 | 2 | |
| 0.9691119691 | 3 | |
| 0.9629343629 | 2 | |
| 0.9505791506 | 1 | 0.1% |
| 0.9444015444 | 1 | 0.1% |
| 0.9382239382 | 4 |
BloodPressure
Real number (ℝ)
Zeros 
| Distinct | 42 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.49562355 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 38 |
| Zeros (%) | 4.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.051388889 |
| Q1 | 0.375 |
| median | 0.51388889 |
| Q3 | 0.625 |
| 95-th percentile | 0.76388889 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.25 |
Descriptive statistics
| Standard deviation | 0.19718388 |
|---|---|
| Coefficient of variation (CV) | 0.39785009 |
| Kurtosis | 0.62049494 |
| Mean | 0.49562355 |
| Median Absolute Deviation (MAD) | 0.11111111 |
| Skewness | -0.40603553 |
| Sum | 380.63889 |
| Variance | 0.038881481 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.4861111111 | 57 | 7.4% |
| 0.5416666667 | 52 | 6.8% |
| 0.4583333333 | 45 | 5.9% |
| 0.5972222222 | 45 | 5.9% |
| 0.5138888889 | 44 | 5.7% |
| 0.4027777778 | 43 | 5.6% |
| 0.625 | 40 | 5.2% |
| 0.5694444444 | 39 | 5.1% |
| 0 | 38 | 4.9% |
| 0.3472222222 | 37 | 4.8% |
| Other values (32) | 328 |
| Value | Count | Frequency (%) |
| 0 | 38 | |
| 0.04166666667 | 1 | 0.1% |
| 0.06944444444 | 1 | 0.1% |
| 0.125 | 4 | 0.5% |
| 0.1527777778 | 2 | 0.3% |
| 0.1805555556 | 5 | 0.7% |
| 0.2083333333 | 13 | 1.7% |
| 0.2361111111 | 11 | 1.4% |
| 0.2638888889 | 11 | 1.4% |
| 0.2777777778 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 0.9861111111 | 3 | 0.4% |
| 0.9583333333 | 2 | 0.3% |
| 0.9305555556 | 1 | 0.1% |
| 0.9027777778 | 3 | 0.4% |
| 0.875 | 3 | 0.4% |
| 0.8472222222 | 4 | |
| 0.8333333333 | 1 | 0.1% |
| 0.8194444444 | 6 | |
| 0.7916666667 | 8 |
SkinThickness
Real number (ℝ)
High correlation  Zeros 
| Distinct | 51 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.25639648 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 227 |
| Zeros (%) | 29.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.2875 |
| Q3 | 0.4 |
| 95-th percentile | 0.55 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.1980593 |
|---|---|
| Coefficient of variation (CV) | 0.77247278 |
| Kurtosis | -0.98116285 |
| Mean | 0.25639648 |
| Median Absolute Deviation (MAD) | 0.15 |
| Skewness | 0.026662981 |
| Sum | 196.9125 |
| Variance | 0.039227488 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 227 | |
| 0.4 | 31 | 4.0% |
| 0.375 | 27 | 3.5% |
| 0.3375 | 23 | 3.0% |
| 0.2875 | 22 | 2.9% |
| 0.4125 | 20 | 2.6% |
| 0.35 | 20 | 2.6% |
| 0.225 | 20 | 2.6% |
| 0.3875 | 19 | 2.5% |
| 0.2375 | 18 | 2.3% |
| Other values (41) | 341 |
| Value | Count | Frequency (%) |
| 0 | 227 | |
| 0.0875 | 2 | 0.3% |
| 0.1 | 2 | 0.3% |
| 0.125 | 5 | 0.7% |
| 0.1375 | 6 | 0.8% |
| 0.15 | 7 | 0.9% |
| 0.1625 | 11 | 1.4% |
| 0.175 | 6 | 0.8% |
| 0.1875 | 14 | 1.8% |
| 0.2 | 6 | 0.8% |
| Value | Count | Frequency (%) |
| 1 | 1 | 0.1% |
| 0.7875 | 1 | 0.1% |
| 0.75 | 1 | 0.1% |
| 0.7 | 1 | 0.1% |
| 0.675 | 2 | |
| 0.65 | 2 | |
| 0.6375 | 1 | 0.1% |
| 0.625 | 3 | |
| 0.6125 | 3 | |
| 0.6 | 4 |
Insulin
Real number (ℝ)
High correlation  Zeros 
| Distinct | 157 |
|---|---|
| Distinct (%) | 20.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.23152116 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 374 |
| Zeros (%) | 48.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.095874263 |
| Q3 | 0.4 |
| 95-th percentile | 0.92102161 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.29414862 |
|---|---|
| Coefficient of variation (CV) | 1.2705042 |
| Kurtosis | 0.40783879 |
| Mean | 0.23152116 |
| Median Absolute Deviation (MAD) | 0.095874263 |
| Skewness | 1.1738981 |
| Sum | 177.80825 |
| Variance | 0.086523408 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 374 | |
| 1 | 34 | 4.4% |
| 0.3300589391 | 11 | 1.4% |
| 0.4086444008 | 9 | 1.2% |
| 0.4400785855 | 9 | 1.2% |
| 0.3772102161 | 8 | 1.0% |
| 0.295481336 | 7 | 0.9% |
| 0.3143418468 | 7 | 0.9% |
| 0.5658153242 | 7 | 0.9% |
| 0.3614931238 | 6 | 0.8% |
| Other values (147) | 296 |
| Value | Count | Frequency (%) |
| 0 | 374 | |
| 0.04400785855 | 1 | 0.1% |
| 0.04715127701 | 1 | 0.1% |
| 0.05029469548 | 1 | 0.1% |
| 0.05658153242 | 2 | 0.3% |
| 0.06915520629 | 1 | 0.1% |
| 0.07229862475 | 2 | 0.3% |
| 0.07858546169 | 1 | 0.1% |
| 0.09115913556 | 1 | 0.1% |
| 0.100589391 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 34 | |
| 0.9996070727 | 1 | 0.1% |
| 0.974459725 | 1 | 0.1% |
| 0.9555992141 | 1 | 0.1% |
| 0.9430255403 | 1 | 0.1% |
| 0.921021611 | 2 | 0.3% |
| 0.9147347741 | 1 | 0.1% |
| 0.8958742633 | 2 | 0.3% |
| 0.8927308448 | 1 | 0.1% |
| 0.8801571709 | 1 | 0.1% |
BMI
Real number (ℝ)
Zeros 
| Distinct | 242 |
|---|---|
| Distinct (%) | 31.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.50470605 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 11 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.22715054 |
| Q1 | 0.375 |
| median | 0.50134409 |
| Q3 | 0.625 |
| 95-th percentile | 0.83454301 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.25 |
Descriptive statistics
| Standard deviation | 0.18950494 |
|---|---|
| Coefficient of variation (CV) | 0.37547586 |
| Kurtosis | 0.050052852 |
| Mean | 0.50470605 |
| Median Absolute Deviation (MAD) | 0.12365591 |
| Skewness | 0.1358086 |
| Sum | 387.61425 |
| Variance | 0.035912121 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.501344086 | 13 | 1.7% |
| 0.4905913978 | 12 | 1.6% |
| 0.4798387097 | 12 | 1.6% |
| 0 | 11 | 1.4% |
| 0.5362903226 | 10 | 1.3% |
| 0.5120967742 | 10 | 1.3% |
| 0.4502688172 | 9 | 1.2% |
| 0.5228494624 | 9 | 1.2% |
| 0.5255376344 | 9 | 1.2% |
| 0.4690860215 | 9 | 1.2% |
| Other values (232) | 664 |
| Value | Count | Frequency (%) |
| 0 | 11 | |
| 0.1303763441 | 3 | 0.4% |
| 0.1357526882 | 1 | 0.1% |
| 0.1545698925 | 1 | 0.1% |
| 0.1599462366 | 1 | 0.1% |
| 0.1626344086 | 1 | 0.1% |
| 0.1653225806 | 2 | 0.3% |
| 0.1680107527 | 3 | 0.4% |
| 0.1760752688 | 1 | 0.1% |
| 0.1787634409 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 8 | |
| 0.9852150538 | 1 | 0.1% |
| 0.9771505376 | 1 | 0.1% |
| 0.9744623656 | 1 | 0.1% |
| 0.9663978495 | 1 | 0.1% |
| 0.9529569892 | 1 | 0.1% |
| 0.939516129 | 1 | 0.1% |
| 0.9287634409 | 2 | 0.3% |
| 0.8991935484 | 2 | 0.3% |
| 0.8965053763 | 1 | 0.1% |
DiabetesPedigreeFunction
Real number (ℝ)
| Distinct | 517 |
|---|---|
| Distinct (%) | 67.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.16817946 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.026622545 |
| Q1 | 0.070772844 |
| median | 0.12574722 |
| Q3 | 0.23409479 |
| 95-th percentile | 0.45040564 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.16332195 |
Descriptive statistics
| Standard deviation | 0.1414725 |
|---|---|
| Coefficient of variation (CV) | 0.84119962 |
| Kurtosis | 5.5949535 |
| Mean | 0.16817946 |
| Median Absolute Deviation (MAD) | 0.071520068 |
| Skewness | 1.9199111 |
| Sum | 129.16183 |
| Variance | 0.020014468 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.07685738685 | 6 | 0.8% |
| 0.07514944492 | 6 | 0.8% |
| 0.08112724167 | 5 | 0.7% |
| 0.05508112724 | 5 | 0.7% |
| 0.0781383433 | 5 | 0.7% |
| 0.07728437233 | 5 | 0.7% |
| 0.0683176772 | 5 | 0.7% |
| 0.04782237404 | 4 | 0.5% |
| 0.07899231426 | 4 | 0.5% |
| 0.09436379163 | 4 | 0.5% |
| Other values (507) | 719 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.002561912895 | 1 | |
| 0.002988898377 | 2 | |
| 0.004269854825 | 2 | |
| 0.004696840307 | 1 | |
| 0.005977796755 | 1 | |
| 0.007685738685 | 1 | |
| 0.009393680615 | 1 | |
| 0.009820666097 | 1 | |
| 0.01024765158 | 1 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 0.9611443211 | 1 | |
| 0.9436379163 | 1 | |
| 0.8791631085 | 1 | |
| 0.7749786507 | 1 | |
| 0.7271562767 | 1 | |
| 0.7058070026 | 1 | |
| 0.6921434671 | 1 | |
| 0.6917164816 | 1 | |
| 0.6498719044 | 1 |
Age
Real number (ℝ)
High correlation  Zeros 
| Distinct | 47 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.26812901 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 63 |
| Zeros (%) | 8.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.065934066 |
| median | 0.17582418 |
| Q3 | 0.43956044 |
| 95-th percentile | 0.81318681 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.37362637 |
Descriptive statistics
| Standard deviation | 0.25556932 |
|---|---|
| Coefficient of variation (CV) | 0.95315804 |
| Kurtosis | 0.33096993 |
| Mean | 0.26812901 |
| Median Absolute Deviation (MAD) | 0.15384615 |
| Skewness | 1.0671703 |
| Sum | 205.92308 |
| Variance | 0.065315676 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.02197802198 | 72 | 9.4% |
| 0 | 63 | 8.2% |
| 0.08791208791 | 48 | 6.2% |
| 0.06593406593 | 46 | 6.0% |
| 0.04395604396 | 38 | 4.9% |
| 0.1538461538 | 35 | 4.6% |
| 0.1098901099 | 33 | 4.3% |
| 0.1318681319 | 32 | 4.2% |
| 0.1758241758 | 29 | 3.8% |
| 0.2197802198 | 24 | 3.1% |
| Other values (37) | 348 |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 0.02197802198 | 72 | |
| 0.04395604396 | 38 | |
| 0.06593406593 | 46 | |
| 0.08791208791 | 48 | |
| 0.1098901099 | 33 | |
| 0.1318681319 | 32 | |
| 0.1538461538 | 35 | |
| 0.1758241758 | 29 | |
| 0.1978021978 | 21 | 2.7% |
| Value | Count | Frequency (%) |
| 1 | 9 | |
| 0.989010989 | 4 | |
| 0.967032967 | 3 | 0.4% |
| 0.9450549451 | 1 | 0.1% |
| 0.9230769231 | 4 | |
| 0.9010989011 | 4 | |
| 0.8791208791 | 2 | 0.3% |
| 0.8571428571 | 5 | |
| 0.8351648352 | 3 | 0.4% |
| 0.8131868132 | 7 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 1.0 |
| 4th row | 0.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 500 | |
| 1.0 | 268 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 500 | |
| 1.0 | 268 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1268 | |
| . | 768 | |
| 1 | 268 | 11.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2304 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1268 | |
| . | 768 | |
| 1 | 268 | 11.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2304 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1268 | |
| . | 768 | |
| 1 | 268 | 11.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2304 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1268 | |
| . | 768 | |
| 1 | 268 | 11.6% |
Interactions
Correlations
| Age | BMI | BloodPressure | DiabetesPedigreeFunction | Glucose | Insulin | Outcome | Pregnancies | SkinThickness | |
|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.131 | 0.351 | 0.043 | 0.285 | -0.114 | 0.329 | 0.607 | -0.067 |
| BMI | 0.131 | 1.000 | 0.292 | 0.141 | 0.231 | 0.192 | 0.317 | 0.000 | 0.444 |
| BloodPressure | 0.351 | 0.292 | 1.000 | 0.030 | 0.236 | -0.007 | 0.150 | 0.185 | 0.126 |
| DiabetesPedigreeFunction | 0.043 | 0.141 | 0.030 | 1.000 | 0.091 | 0.221 | 0.173 | -0.043 | 0.180 |
| Glucose | 0.285 | 0.231 | 0.236 | 0.091 | 1.000 | 0.213 | 0.484 | 0.131 | 0.060 |
| Insulin | -0.114 | 0.192 | -0.007 | 0.221 | 0.213 | 1.000 | 0.265 | -0.126 | 0.541 |
| Outcome | 0.329 | 0.317 | 0.150 | 0.173 | 0.484 | 0.265 | 1.000 | 0.248 | 0.207 |
| Pregnancies | 0.607 | 0.000 | 0.185 | -0.043 | 0.131 | -0.126 | 0.248 | 1.000 | -0.085 |
| SkinThickness | -0.067 | 0.444 | 0.126 | 0.180 | 0.060 | 0.541 | 0.207 | -0.085 | 1.000 |
Missing values
Sample
| Pregnancies | Glucose | BloodPressure | SkinThickness | Insulin | BMI | DiabetesPedigreeFunction | Age | Outcome | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.444444 | 0.684942 | 0.513889 | 0.4375 | 0.000000 | 0.544355 | 0.234415 | 0.637363 | 1.0 |
| 1 | 0.074074 | 0.295753 | 0.430556 | 0.3625 | 0.000000 | 0.356183 | 0.116567 | 0.219780 | 0.0 |
| 2 | 0.592593 | 0.901158 | 0.402778 | 0.0000 | 0.000000 | 0.267473 | 0.253629 | 0.241758 | 1.0 |
| 3 | 0.074074 | 0.320463 | 0.430556 | 0.2875 | 0.295481 | 0.396505 | 0.038002 | 0.000000 | 0.0 |
| 4 | 0.000000 | 0.616988 | 0.069444 | 0.4375 | 0.528094 | 0.799731 | 0.943638 | 0.263736 | 1.0 |
| 5 | 0.370370 | 0.487259 | 0.541667 | 0.0000 | 0.000000 | 0.329301 | 0.052519 | 0.197802 | 0.0 |
| 6 | 0.222222 | 0.252510 | 0.208333 | 0.4000 | 0.276621 | 0.474462 | 0.072588 | 0.109890 | 1.0 |
| 7 | 0.740741 | 0.481081 | 0.000000 | 0.0000 | 0.000000 | 0.590054 | 0.023911 | 0.175824 | 0.0 |
| 8 | 0.148148 | 0.987645 | 0.486111 | 0.5625 | 1.000000 | 0.461022 | 0.034159 | 0.703297 | 1.0 |
| 9 | 0.592593 | 0.542857 | 0.847222 | 0.0000 | 0.000000 | 0.000000 | 0.065756 | 0.725275 | 1.0 |
| Pregnancies | Glucose | BloodPressure | SkinThickness | Insulin | BMI | DiabetesPedigreeFunction | Age | Outcome | |
|---|---|---|---|---|---|---|---|---|---|
| 758 | 0.074074 | 0.425483 | 0.569444 | 0.0000 | 0.000000 | 0.649194 | 0.050811 | 0.109890 | 0.0 |
| 759 | 0.444444 | 0.944402 | 0.791667 | 0.0000 | 0.000000 | 0.595430 | 0.085397 | 0.989011 | 1.0 |
| 760 | 0.148148 | 0.314286 | 0.319444 | 0.3250 | 0.050295 | 0.404570 | 0.293766 | 0.021978 | 0.0 |
| 761 | 0.666667 | 0.820849 | 0.541667 | 0.3875 | 0.000000 | 0.823925 | 0.138770 | 0.483516 | 1.0 |
| 762 | 0.666667 | 0.320463 | 0.375000 | 0.0000 | 0.000000 | 0.245968 | 0.027327 | 0.263736 | 0.0 |
| 763 | 0.740741 | 0.394595 | 0.569444 | 0.6000 | 0.565815 | 0.525538 | 0.039710 | 0.923077 | 0.0 |
| 764 | 0.148148 | 0.524324 | 0.486111 | 0.3375 | 0.000000 | 0.630376 | 0.111870 | 0.131868 | 0.0 |
| 765 | 0.370370 | 0.518147 | 0.513889 | 0.2875 | 0.352063 | 0.345430 | 0.071307 | 0.197802 | 0.0 |
| 766 | 0.074074 | 0.549035 | 0.347222 | 0.0000 | 0.000000 | 0.450269 | 0.115713 | 0.571429 | 1.0 |
| 767 | 0.074074 | 0.345174 | 0.486111 | 0.3875 | 0.000000 | 0.458333 | 0.101196 | 0.043956 | 0.0 |